Improving Taxonomic Name Finding in the Biodiversity Heritage Library
Bookreader Item Preview
Share or Embed This Item
- Publication date
- 2020-9-18
- Usage
- https://creativecommons.org/share-your-work/public-domain/cc0/
- Topics
- taxonomic name identification, OCR, optical character recognition
- Publisher
- Pensoft Publishers
- Collection
- biodiversity
- Contributor
- Pensoft Publishers
- Language
- English
- Rights
- https://biodiversitylibrary.org/permissions
- Rights-holder
- Copyright held by individual article author(s).
- Volume
- 4
- Item Size
- 787.2K
- Abstract
- As the world’s largest open access digital library for biodiversity literature and archives, the Biodiversity Heritage Library (BHL) provides access to over a quarter-million volumes of natural history literature to researchers around the world. One of its services is to index taxonomic names in the collection to allow researchers to locate publications about specific taxa. The Global Names Architecture (GNA) is a system of web services to register, find, index, check and organize biological scientific names. GNA recently developed a new Name Finding algorithm and tool that has been integrated with BHL to improve taxonomic name searches within BHL. In our presentation, we will discuss a brief history of name finding in BHL, development of the Name Finding algorithm, results from implementing the algorithm, and challenges that still await us in the realm of taxonomic name finding in BHL.
- Addeddate
- 2025-06-09 19:41:33
- Bhl_virtual_titleid
- 210882
- Bhl_virtual_volume
- v.4 (2020)
- Call number
- 10_3897_biss_4_58482
- Call-number
- 10_3897_biss_4_58482
- Foldoutcount
- 0
- Genre
- article
- Identifier
- improvingtaxono4rich
- Identifier-ark
- ark:/13960/s2z3434xgrs
- Identifier-bib
- 10_3897_biss_4_58482
- Identifier-doi
- 10.3897/biss.4.58482
- Ocr
- tesseract 5.3.0-6-g76ae
- Ocr_detected_lang
- en
- Ocr_detected_lang_conf
- 1.0000
- Ocr_detected_script
- Latin
- Ocr_detected_script_conf
- 1.0000
- Ocr_module_version
- 0.0.21
- Ocr_parameters
- -l eng
- Page_number_confidence
- 0
- Page_number_module_version
- 1.0.5
- Page_range
- 1-2
- Pages
- 2
- Pdf_degraded
- invalid-jp2-headers
- Pdf_module_version
- 0.0.25
- Possible copyright status
- In copyright. Digitized with the permission of the rights holder.
- Ppi
- 300
- Source
- Biodiversity Information Science and Standards 4
- Year
- 2020
comment
Reviews
38 Views
DOWNLOAD OPTIONS
For users with print-disabilities
IN COLLECTIONS
Biodiversity Heritage LibraryUploaded by Smithsonian Libraries and Archives on
Open Library